Search CORE

1,043 research outputs found

Towards the Teraflop CFD

Author: Schreiber Robert
Simon Horst D.
Publication venue
Publication date
Field of study

We are surveying current projects in the area of parallel supercomputers. The machines considered here will become commercially available in the 1990 - 1992 time frame. All are suitable for exploring the critical issues in applying parallel processors to large scale scientific computations, in particular CFD calculations. This chapter presents an overview of the surveyed machines, and a detailed analysis of the various architectural and technology approaches taken. Particular emphasis is placed on the feasibility of a Teraflops capability following the paths proposed by various developers

NASA Technical Reports Server

Spectral ordering techniques for incomplete LU preconditoners for CG methods

Author: Clift Simon S.
Simon Horst D.
Tang Wei-Pai
Publication venue
Publication date
Field of study

The effectiveness of an incomplete LU (ILU) factorization as a preconditioner for the conjugate gradient method can be highly dependent on the ordering of the matrix rows during its creation. Detailed justification for two heuristics commonly used in matrix ordering for anisotropic problems is given. The bandwidth reduction and weak connection following heuristics are implemented through an ordering method based on eigenvector computations. This spectral ordering is shown to be a good representation of the heuristics. Analysis and test cases in two and three dimensional diffusion problems demonstrate when ordering is important, and when an ILU decomposition will be ordering insensitive. The applicability of the heuristics is thus evaluated and placed on a more rigorous footing

NASA Technical Reports Server

HARP: A Dynamic Inertial Spectral Partitioner

Author: Biswas Rupak
Simon Horst D.
Sohn Andrew
Publication venue
Publication date
Field of study

Partitioning unstructured graphs is central to the parallel solution of computational science and engineering problems. Spectral partitioners, such recursive spectral bisection (RSB), have proven effecfive in generating high-quality partitions of realistically-sized meshes. The major problem which hindered their wide-spread use was their long execution times. This paper presents a new inertial spectral partitioner, called HARP. The main objective of the proposed approach is to quickly partition the meshes at runtime in a manner that works efficiently for real applications in the context of distributed-memory machines. The underlying principle of HARP is to find the eigenvectors of the unpartitioned vertices and then project them onto the eigerivectors of the original mesh. Results for various meshes ranging in size from 1000 to 100,000 vertices indicate that HARP can indeed partition meshes rapidly at runtime. Experimental results show that our largest mesh can be partitioned sequentially in only a few seconds on an SP2 which is several times faster than other spectral partitioners while maintaining the solution quality of the proven RSB method. A parallel WI version of HARP has also been implemented on IBM SP2 and Cray T3E. Parallel HARP, running on 64 processors SP2 and T3E, can partition a mesh containing more than 100,000 vertices into 64 subgrids in about half a second. These results indicate that graph partitioning can now be truly embedded in dynamically-changing real-world applications

NASA Technical Reports Server

Impact of Load Balancing on Unstructured Adaptive Grid Computations for Distributed-Memory Multiprocessors

Author: Biswas Rupak
Simon Horst D.
Sohn Andrew
Publication venue
Publication date
Field of study

The computational requirements for an adaptive solution of unsteady problems change as the simulation progresses. This causes workload imbalance among processors on a parallel machine which, in turn, requires significant data movement at runtime. We present a new dynamic load-balancing framework, called JOVE, that balances the workload across all processors with a global view. Whenever the computational mesh is adapted, JOVE is activated to eliminate the load imbalance. JOVE has been implemented on an IBM SP2 distributed-memory machine in MPI for portability. Experimental results for two model meshes demonstrate that mesh adaption with load balancing gives more than a sixfold improvement over one without load balancing. We also show that JOVE gives a 24-fold speedup on 64 processors compared to sequential execution

NASA Technical Reports Server

09061 Abstracts Collection -- Combinatorial Scientific Computing

Author: Naumann Uwe
Schenk Olaf
Simon Horst D
Toledo Sivan
Publication venue: Dagstuhl Seminar Proceedings. 09061 - Combinatorial Scientific Computing
Publication date: 01/01/2009
Field of study

From 01.02.2009 to 06.02.2009, the Dagstuhl Seminar 09061 ``Combinatorial Scientific Computing \u27\u27 was held in Schloss Dagstuhl -- Leibniz Center for Informatics. During the seminar, several participants presented their current research, and ongoing work and open problems were discussed. Abstracts of the presentations given during the seminar as well as abstracts of seminar results and ideas are put together in this paper. The first section describes the seminar topics and goals in general. Links to extended abstracts or full papers are provided, if available

Dagstuhl Research Online Publication Server

Recommended from our members

Supporting National User Communities at NERSC and NCAR

Author: Killeen Timothy L.
Simon Horst D.
Publication venue: Lawrence Berkeley National Laboratory
Publication date: 16/05/2006
Field of study

The National Energy Research Scientific Computing Center(NERSC) and the National Center for Atmospheric Research (NCAR) are twocomputing centers that have traditionally supported large national usercommunities. Both centers have developed responsive approaches to supportthese user communities and their changing needs, providing end-to-endcomputing solutions. In this report we provide a short overview of thestrategies used at our centers in supporting our scientific users, withan emphasis on some examples of effective programs and futureneeds

eScholarship - University of California

UNT Digital Library

Recommended from our members

On the Equivalence of Nonnegative Matrix Factorization and K-means- Spectral Clustering

Author: Ding Chris
He Xiaofeng
Jin Rong
Simon Horst D.
Publication venue: Lawrence Berkeley National Laboratory
Publication date: 04/12/2005
Field of study

We provide a systematic analysis of nonnegative matrix factorization (NMF) relating to data clustering. We generalize the usual X = FG{sup T} decomposition to the symmetric W = HH{sup T} and W = HSH{sup T} decompositions. We show that (1) W = HH{sup T} is equivalent to Kernel K-means clustering and the Laplacian-based spectral clustering. (2) X = FG{sup T} is equivalent to simultaneous clustering of rows and columns of a bipartite graph. We emphasizes the importance of orthogonality in NMF and soft clustering nature of NMF. These results are verified with experiments on face images and newsgroups

UNT Digital Library

Computational Biology and High Performance Computing 2000

Author: Simon Horst D.
Zorn Manfred D.
Spengler Sylvia J.
Shoichet Brian K.
Stewart Craig A.
Dubchak Inna L.
Arkin Adam P.
Publication venue
Publication date: 01/10/2000
Field of study

Tutorial to be presented at Supercomputing 2000, Dallas TX, 6-10 November 2000.This work was supported by the Director, Office of Science, Office of Advanced Scientific computing Research, Mathematical, Information, and Computational Sciences Division of the U.S. Department of Energy under Contract No. DE-AC03-76SF0009

University of Alabama Libraries: Acumen

IUScholarWorks (University of Indiana)

Computational Biology and High Performance Computing 2000

Author: Arkin Adam P.
Dubchak Inna L.
Shoichet Brian K.
Simon Horst D.
Spengler Sylvia J.
Stewart Craig A.
Zorn Manfred D.
Publication venue
Publication date: 01/10/2000
Field of study

IUScholarWorks (University of Indiana)